Persian Computational Morphology: A Unification-Based Approach
نویسنده
چکیده
This report provides a complete descriptive analysis of Persian inflectional morphology from a computational perspective. The parts of speech and the morphemes that appear on them as well as their corresponding morphotactics are presented in detail. The verbal paradigm is also described in this document. Since the morphological analyzer designed for this project uses a unification-based grammar with typed feature structures, the morphological information has been defined in terms of features and values. The report describes the current version of the morphological analyzer used in the Shiraz project and discusses any morphological elements that have not been included in this version, mostly due to the colloquial usage of these morphemes. Sample rules of Samba, the grammar specifying the morphological analyzer, as well as the feature specification for the Persian type definitions module are also described.
منابع مشابه
Unification-Based Persian Morphology
We present a complete formalization of Persian inflectional morphology using a unification-based framework. The morphological analyzer was developed for use in a Persian-English machine translation system; it computes the part of speech categories and returns all syntactically relevant inflectional features for a word. The morphological analyses are represented as feature structures, which can ...
متن کاملFinite-State Morphological Analysis Of Persian
This paper describes a two-level morphological analyzer for Persian using a system based on the Xerox finite state tools. Persian language presents certain challenges to computational analysis: There is a complex verbal conjugation paradigm which includes long-distance morphological dependencies; phonological alternations apply at morpheme boundaries; word and noun phrase boundaries are difficu...
متن کاملA hidden Markov model for Persian part-of-speech tagging
One of the important actions in the processing of languages is part-of-speech tagging. Against of this importance, although numerous models have been presented in different languages but there is few works have been done in Persian language. In this paper, a part-of-speech tagging system on Persian corpus by using hidden Markov model is proposed. Achieving to this goal, the main aspects of Pers...
متن کاملSemantic Morphology
Semantic Morphology addresses the problem of designing the rules needed for mapping between the semantic lexicon and semantic grammar. The text discusses the relation between semantics, lexicon, and morphology in unification-based grammars and builds on the current trends in Computational Semantics to use underspecification and compositionality. The approach to Semantic Morphology advocated her...
متن کاملMorpho-Phonological Modelling in Natural Language Processing
In this paper we propose a computational model for the representation and processing of morpho-phonological phenomena in a natural language, like Modern Greek. We aim at a unified treatment of inflection, compounding, and word-internal phonological changes, in a model that is used for both analysis and generation. After discussing certain difficulties cuase by well-known finitestate approaches,...
متن کامل